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Abstract. We consider records and sequences of records drawn from discrete time 
(]} ' series of the form X n = Y n + cn, where the Y n are independent and identically 

^ ■ distributed random variables and c is a constant drift. For very small and very large 

drift velocities, we investigate the asymptotic behavior of the probability p n (c) of a 
record occurring in the nth step and the probability Pn (c) that all N entries are 
CZ3 . records, i.e. that X\ < X2 < ... < X^. Our work is motivated by the analysis of 

temperature time series in climatology, and by the study of mutational pathways in 
evolutionary biology. 
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1. Introduction 

A record is an entry in a discrete time series that is larger (upper record) or smaller 
(lower record) than all previous entries. In this sense, a record is an extreme value that 
is defined relative to all previous values in the time series. Record events are of interest 
in various areas of life and science such as climatology [U El El SI E] and sports [6j [7] , 
but also in biology [8j [9j [10] . A record is usually a rare and remarkable event that will 
be remembered by observers. Not without good reason the term record originates from 
the Latin verb recordari - to recall, to remind. 

The classic results for records drawn from series of independent and identically 
distributed (i.i.d.) random variables (RV's) are well established, see [TTJ [121 1131 EE] for 
review. In this work we concentrate on two important quantities in particular. The 
first one is the probability for a certain entry in a time series to be a record, and the 
second one is the probability that the entries of a time series are ordered, or in other 
words, that all events are records. For i.i.d. RV's both these quantities are completely 
universal for all continuous probability density functions. This can be shown by the so 
called stick- shuffling argument: The last one of n identically distributed entries (sticks) 
in a time series is equally likely to be a record as all other entries, and therefore the 
probability p n for the nth event to be a record, henceforth referred to as the record rate, 
is given by 

Pn = (1) 

n 

Accordingly the expected mean number of records R n up to a time n can be obtained 
by computing the harmonic sum: R n = X^LiV^ ~ In (n) + 7 + 0(l/n), where 
7 ~ 0.577215... is the Euler-Mascheroni constant. From similar considerations one 
obtains the statistics of waiting times between record breaking events which turn out 
to be universal as well. It is equally straightforward to compute the probability for all 
events in a series of length N to be ordered in size. Since this case is only one of N\ 
possible and equally likely permutations of all N events, the ordering probability Pn is 
given by 

p « = w (2) 

We conclude that the two quantities of interest are related by 

N 

Pn = Y[pu, (3) 

n=l 

which reflects the fact that record events are independent in the i.i.d. case [HI IT3] . 
We will return to this point below in section 2. In contrast to the properties of 
record times, the distributions of record values are not completely universal, but their 
asymptotic behavior falls into three different universality classes that are analogous to 
the universality classes of extreme value statistics: The Weibull class of distributions 
with finite support, the Gumbel class of distributions with exponential-like tails, and 
the Frechet class of power law tailed distributions [H)J [HI dZ] . 
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Given that the statistics of records for i.i.d. RV's is well understood, it is natural 
to ask what happens when the underlying time series is correlated, or when the RV's 
are drawn from a distribution that varies in time. An important example of a correlated 
random process is the random walk, and the record statistics of this process was recently 
analyzed by Majumdar and Ziff [HI [19]. The simplest realization of a time-dependent 
distribution is the linear drift model (LDM) first considered by Ballerini and Resnick 
[20| I2T] . In this model the nth entry in the time series is of the form 

X n = Y n + cn, (4) 

where c is a constant and the Y n are i.i.d. RV's. In this simple scenario the probability 
density f n (x) of X n is of the form /„ (x) = f (x — cn) with a fixed probability density 
f (y) and the corresponding cumulative distribution function F(y) = dy' f(y'), 
which is the distribution of the i.i.d. part Y n of X n . We will usually consider upper 
records and assume c > 0. 

The LDM was originally introduced as a model for sports records in improving 
populations [2D] , and it has recently appeared in the context of the dynamics of 
elastic manifolds in random media [22J. An important motivation for the present 
work comes from the interest in the consequences of global warming for the occurrence 
of temperature records [31 Hj. In j5j [23] the effect of warming on daily temperature 
measurements was modeled using a Gaussian probability density with a linear trend, 
and it was shown that this very simple model is capable of quantitatively describing the 
statistics of record-breaking temperatures at European and American weather stations. 
In the climate context the drift speed c is typically small compared to the standard 
deviation of /, which suggests to consider the behavior of the record rate p n (c) for 
small c and finite n. This approach is complementary to previous work on the LDM 
[201 I2T1 1221 124"] . which has mostly been concerned with the asymptotic behavior of the 
record rate for n — > 00. 

Another application that motivates our research comes from the study of adaptive 
paths in evolutionary biology. In this context, a path is a collection of mutations that 
change the genotype of an organism into another genotype of higher fitness. Given that 
mutation rates are small, the evolution of a population usually proceeds one mutation 
at a time. For a given set of iV mutations, there are then N\ distinct paths which 
correspond to the different orders in which the mutations can occur. Since a mutation 
spreads in the population only if it confers a fitness advantage, a given pathway is 
accessible to adaptive evolution only if the fitness values of the intermediate genotypes 
increase monotonically along the path, that is, if they are arranged in ascending order 
[251 [26]. 

In view of the complexity of real fitness landscapes, the intricate interactions 
between different mutations are often modeled by assigning fitness values at random 
to genotypes [ID] . One such model, which is closely related to the LDM, was introduced 
by Aita et al. in the context of protein evolution [27]. In this model the fitness X n of 
a particular intermediate genotype with n mutations is assumed to consist of an i.i.d. 
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RV Y n and a systematic part cn, where c > if the mutations move the population 
closer to the global fitness peak, and the value of c (relative to the standard deviation 
of the Y n ) can be adjusted to tune the ruggedness of the fitness landscape. Taking 
into account also the initial genotype with no mutations, a total of N + 1 genotypes 
with fitness values Xq, X\, X^ are encountered along a path. The probability for 
a path to be accessible in this model is then just P N+ i(c), and the expected number 
of accessible paths is N\P N+1 (c). An immediate corollary of (T5]) is that the expected 
number of accessible paths of length N in a completely random fitness landscape without 
any average uphill slope (c = 0) is l/(N + 1) [281 129] - 

Here we consider both the record rate p n and the ordering probability P/v for the 
linear drift model. We distinguish between a small drift c that is much smaller than 
the characteristic width of the distribution (in most cases the standard deviation), 
and a large drift that is much larger than this width. Both cases are of practical 
relevance. In section 2 we discuss the general properties of record statistics for systems 
with linear drift, with particular emphasis on the correlations between record events. 
In the subsequent section 3 we will present new results for small c. We examine the 
record statistics for members of the three extreme-value classes individually and find 
the corresponding asymptotic behaviors. In section 4 we analyze the case of large c. 
Throughout Monte-Carlo simulations are used to confirm the analytical results. Finally, 
in section 5 we present a brief summary, discuss related issues and give an outlook on 
further possible research. Some of the calculational details are relegated to Appendices. 



2. General theory and an exactly solvable example 

The values taken by the {^i}ie{i,...,n} are stochastically independent. The probability 
that all n values are less than a given value x factorizes to niLi I- ^ x ifi ( x i) = 
n™ =1 Fj(x). Here fi and F { are, as stated in the introduction, the probability densities 
and cumulative distribution functions of the X{. Thus, given the value y n of the i.i.d. 
part Y n of X n , the probability that all previous RV's {Xj}^^ n _ 1 | are smaller than 

X n is Yli=i Jl" l ° d-Vn-ifiyn-i)- The probability that a RV X n drawn from a general 
time-dependent distribution F n (x) is a record is therefore given by [30] 

n-1 



X n = max {Xi} 

i€{l,—,n} 



/oo " i 

dx/ n (x) Y\ Fi 
■°° i=i 



Pn= dxf n (x) TT Fi (x) , (5) 



P 

which reduces to 

/co n—1 
dxf(x) F(x + ci) (6) 
-°° i=X 

for the LDM. It was shown in [20] that the limiting record rate 

/oo °° 
dxf(x) TT F(x + ci) (7) 
-°° i=i 
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exists and is nonzero for c > provided the distribution f(y) of the i.i.d. part in fll]) has 
a finite first moment. For c = 0, ([6]) can be evaluated directly and, with the substitution 
u = F(x), one obtains 

poo pF(oo)=1 1 

p n (c = 0) = / dxf(x)F(x) n - 1 = / duu 71 - 1 = -, (8) 



F(-oo)=0 



n 



independent of F, as already shown in ([I]). 

The other quantity under consideration in this article, the ordering probability P^, 
can be expressed as 

/oo poo 
dx N f N (x N )... dxi/i(xi)l xl<:E2 ... <:cjv . (9) 
oo J —oo 

Inserting f n (x) = f{x — cn) the indicator function t xl <x 2 <...<x N can be absorbed in the 
integral boundaries to yield 

poo ryN+c rV2+c 

Pn(c)= dy N f(y N ) dy N ^... d Vl f{ yi ). (10) 



As for p n (c), this equation can be solved for arbitrary F only in the case c = 0. Using 
again the substitution u = F(x) in turn in all the N integrals, starting from the inside, 
one obtains the result already derived in (J2j), 

dy N f(y N ) / dy N -.! ... j dy 2 f(y2)F(y 2 ) 

■oo J —oo J —oo 

oo ry rF{y 3 ) 

dynfiVN) I dy N _i ... j duu 

-oo J —oo J —oo 

n / &VNf{yN) j dyjv-l • • • / duu 2 = ... = -r^-.(ll) 



— oo 



The reason for re-deriving the two previous results is that here this is done in a way 
that in principle generalizes to arbitrary c. 

For c > 0, the exact evaluation of Eqs.Q and (flUj) has proven difficult, but in the 
case where the Y n are Gumbel distributed, i.e. F(y) = exp (— e~ y ), one can use the fact 
that this distribution obeys the relation F(y + a) = F(y) exp( - -a ) to explicitly perform 
the integration in ([6|) j20l [21]. With the abbreviation a = e~ c and the substitution 
u = F(y) one obtains 

poo /n— 1 \ . _ c 

Pn(c) = / dyMFM^ * = TV = —— (12) 




by use of the incomplete geometric series. Keeping c fixed, one obtains in the limit 
n — > oo the asymptotic record rate p(c) = 1 — e~ c , while for c — >• one recovers the i.i.d. 
result p n = 1/n. For c < the record rate is seen to decay exponentially in n, which 
implies that the expected number of records R n remains finite for n — > oo. We suspect 
this to be a general feature of the LDM with c < 0, but are not aware of a proof of this 
fact. 
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The relation used to evaluate (TT2"j) for the Gumbel case can also be used in ( TTU|) , as 
before starting from the innermost integral, which yields 

dy N f(yN) / dy N -x ... j dy 2 f{y2)F{y 2 + c) 

-oo J —oo J— oo 

coo PVN+c rF{x3+c) 

dy N f(yN) / d-UN-i ... duu a 

-oo J —oo J— oo 

i poo pXN+c i-F(xn+c) 

— / cWfo*) / dy^_! . . . / dim"* * 1 ) 



^ — OO t/ — oo ./ — oo 

N-l 



II ^r^- ( 13 ) 
1=1 2^k=o a 

Summing the geometric series as in ffl2l . one obtains 

P N (c) = (1 - e-)* = (l - e-) N Z N , (14) 

rin=i c 1 - e ) 

where i?jv is the grand canonical partition function of a system of bosonic particles with 
energy levels n — 1, JV at inverse temperature c. This partition function also occurs 
as one limit in the integer partition problem (see J3TJ 32J and references therein). 

The product Yln=i (1 — exp(— cn)) in the denominator is the so-called q- 
Pochhammer symbol (q\ q) n with q = e~ c . In the limit iV — > oo with fixed c, one 
has the asymptotic expression [33] 

iinc - «-> - <o- « Vv exp (-s + k) • < 15) 

n=l 

and thus, by inserting this into (JT 



Pat(c) « ^exp (iVln(l - e~ c ) + ?L-±}, N » 1. (16) 

On the other hand, taking c > 1 at fixed iV, one has a = exp(— c) <C 1 and thus 
the geometric series in the denominator of (Tl3|) can be approximated to first order in 
a = exp(— c), as l/(X^I=o Q;fe ) ~ 1 — « + 0(a 2 ). Then ( TI3"]) becomes 

Pjv(c) wexp(-(JV-l)a) = exp (-(iV - l)e~ c ) , c> 1. (17) 

This expression is distinguishable from numerical data only in the region of c ~ 
see figure [TJ 

Comparing the exact expressions Eqs. (fT3|) and ( !T2|) . one sees that the relation (J3J) 
obtained in the i.i.d. case remains valid here. This is a consequence of the mutual 
stochastic independence of record events in the LDM with Gumbel-distributed i.i.d. 
part [TU [121 121] • In fact the Gumbel distribution is uniquely characterized by the 
mutual independence of record values and record indicator variables (which indicate 
whether or not a record occurs at time n) [2U 134"] . 

For c > and arbitrary distribution F, however, the record events in the LDM 
are not independent. Numerical studies for several different distributions presented in 
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Figure 1. Comparison of the exact expression (|13|) and the asymptotic expression 
(|17p to numerical simulation. The exact expression is confirmed, and while there is 
a clear difference between simulations and asymptotic expression for small values of c 
in a), the approximation holds with good accuracy for large c (inset of b), lines are 
the asymptotic expressions). The main plot of b) demonstrates the scaling between 
N and c according to (fTT|) . 
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Figure 2. Joint probability of two consecutive record events at times n = 6 and 7, 
divided by the product of the corresponding record rates. This ratio is unity if record 
events are stochastically independent. For c = 0, this is the case, just as for Gumbel- 
distributcd i.i.d. parts (crosses). Note that for other probability densities, the record 
events also seem to become increasingly independent as c grows. 



figure [2] show that the records are negatively correlated and seem to repel each other. 
A more thorough examination of the structure of correlations between record events in 
this model is currently ongoing research [35]. For the purpose of the present discussion 
we merely note that record events appear to become asymptotically uncorrelated for 
large c. This fact will be used to derive some asymptotic results for Pn(c) in section HJ 
First however we consider the case c < 1. 
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3. Record statistics for small drift 

3.1. Record rate 

In the previous section we gave a general expression for the record rate p n (c) of the 
LDM. Here, we derive the first order term in a series expansion for c<l. If c is very 
small can be simplified as follows: 

n-l 

' Cl) 



/OO j; 
dyf(y)l[F(y + 

n-l 

dyf(y)H[F(y) + cif (y)} 
> i=i 

« dyf (y) F n ~ l (y) + C U (n " 1} f° dy/ 2 (y) F"" 2 (y) 

J —oo J —oo 

= - + d n (18) 

n 

4 - r dy/ 2 (y) F"- 2 (y) . (19) 



with 



This expansion is valid provided / (y) is slowly varying between y and y + ci, which 
strictly speaking requires nc to be small compared to the width of the distribution. 
In the following we will evaluate the first order correction coefficient /„ for several 
elementary distributions. 

Before doing this, we show that our formula for p n (c) can be generalized with 
respect to the position of the record in the time-series. Specifically, we consider the 
probability that the kth event in a time-series of length n with linear drift c is a record. 
For this purpose we have to consider the following integral instead of (jSJ): 

/oo n—1 />j/-|-c(i — fc) 

dyfn (y) J] / *Vifi(Vi)- (2°) 

Evaluating this integral in the same way as shown above, we obtain the following 
expression: 

P[X k = max(X 1 ,...,X„)] w - + | (k 2 - k - (n - k) (n - k - 1)) x 



x 



/oo 
dyf 2 (y) F n ~ 2 (y) . (21) 
-oo 



Note that for k = n this expression reduces to our approximation (jTBl for p n (c). 
Apparently for c > this expression assumes its maximum for k = n and its minimum 
for k — 1. The last entry has the largest, and the first entry the smallest chance to be 
the maximum of the series. 
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3.1.1. Weibull class. Let us start by considering the Weibull class of extreme value 
statistics, which contains distributions with finite support. A simple example for a 
member of the Weibull class is a uniform distribution, which takes the value 77- between 
—a and a and outside of this interval. For this case the first order expansion of p n (c) 
is given by 

^ - ^ ^ /: * (a \iat +o <*> • <22 » 

which can be evaluated to yield 

1 n — 1 

p uniform {c) ^_ +c __ (23) 

In this case the correction coefficient I n increases linearly with the number of events n. 
More generally, we consider distributions of the form 

f(y)=ai-yf~ 1 (24) 

with £ > and < y < 1. For these distributions we have 

Pn (c) « 1 + c^^l ^ dye (i - yf- 2 (1 - (i - yf) 1 " • (25) 

The integral is divergent for £ < 1/2, which indicates that p n (c) is a non-analytic 
function of c; this case will be considered elsewhere. For £ > 1/2 we use the substitution 
(1 — y) = z 1 ^ to express the integral in terms of a Beta-function, 

,n 1 ^(n-l) r ( 2 -|)r(n-l) 

^ c « - + c£ 1 J V K) . (26) 

n 2 r(n + l-| 



Using the Stirling approximation for large n one finally arrives at 

pn(c) ^ + f r ( 2- £H' (27) 

which shows that I n generally increases as a power law in the Weibull class. 

3.1.2. Frechet class. As a representative of the Frechet class of extreme value statistics 
we consider a general power-law distribution of the form / (x) = (l//x) x^^ 1 for x > 1 
and /x > 0. For distributions of this kind p n (c) in the small c expansion is given by 

Pn (c) « - + <££Ll^ r dy/i 2 !/" 2 -^ (1 - y-") . (28) 
n I Ji 

Again, the integral is very similar to a Beta-function and it can be transformed into one 
by elementary means. Doing this we find 

^ 1 n(n-l) r (2 + £)r>-l) 
p n (c) « - + c/i 1 - ; V , My , (29) 



n 2 



r ( n +^ + 1 ) 



and using again the Stirling approximation we obtain 

+ f r ( 2 + -)4s;- ( 3 °) 
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Figure 3. Results of Monte-Carlo simulations of the LDM for power law tailed 
distributions of the Frechet class. The figure shows the difference between the record 
rate in the time-independent case for c = and the drifting case with drift c = 0.01. 
This difference is given by - (p n (c) — p n (0)). The dots correspond to simulations with 
different tail coefficients fi = 1,2,3,5 averaged over 10 6 runs, and the lines show the 
analytic predictions. The first order approximation is very good for /j, = 1 and \i = 2 
but becomes less accurate for larger \x. 

In figure [3] we compare this prediction to simulation results. 

While in the case of the Weibull class the correction term I n increases with n, here 
it decays as a power-law vT 1 ^. For /x > 1 the decay is slower than the 1/n-decay of 
the record rate in the absence of a drift, which implies that the drift will nevertheless 
dominate the behavior for long times. This is consistent with the fact that the record 
rate reaches a nonzero asymptotic limit, as given by ([7]), because \i > 1 implies a finite 
first moment for the Y n . On the other hand, for /i < 1 the decay of I n is faster 1/n and 
the limit on the right hand side of ([7]) vanishes for any c, which implies that the drift is 
asymptotically irrelevant. The borderline situation /j = 1 has been studied by De Haan 
and Verkade [36], who find that the asymptotics depends nontrivially on the value of c 
in this case. 

In general the results presented so far show that the effect of the drift on a broad 
distribution is smaller than on a more narrow distribution. A similar qualitative trend 
was found in [30] for probability densities with increasing variance. 
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3.1.3. Gumbel class. The Gumbel class comprises unbounded distributions that decay 
faster than any power-law. A very simple representative of the Gumbel class is the 
exponential distribution f (y) = v~ x e~^ . In this case the first order expansion ( |T8l) 
assumes the following form: 

exvi \ 1 n(n-l) f°° 1 _2y / _y\ n ~ 2 .... 
(c) « - + c-h: -/ dy— e » 1-e A . (31) 

The integral can be solved by two partial integrations and one finds 

«>V^ < 32 > 

that is, the correction term is independent of n. 

The calculation for the Gaussian distribution, arguably the most important member 
of the Gumbel class, is more complicated. For convenience we consider a Gaussian 
distribution of unit variance, 

/( y ) = -Le-£. (33) 



'2k 

The integral of interest reads 

n(n — r 



y /2 

-v 



ir ss = / dye^ ( / dy'e-^r- ) . (34) 



n-2 



-00 \J —00 

which will be evaluated for large n using the saddle point approximation. With the 
definition 

9 (y) := -Z/ 2 + (n - 2) In (-^= J" dy'e^ (35) 

we have 



4 4vr \/d^(^) e ' (36) 

where y denotes the saddle-point of the integral. It turns out that the computation of a 
practicable series-expansion of g (y) can only be done under some approximations and 
by using the non-elementary Lambert-W function [371 EH]- In terms of the W-function 
W (z) defined by the relation W(z)e^^ = z, we find 



y 



\ 



(n-2" 2 



W ^— -!- . (37) 



in 



For large z the Lambert-W function can be approximated by W (z) ~ In (z) — In (In (z)), 
which eventually yields 




P 9 n USS (c)^- + c^^\n[-). (38) 

For a detailed derivation of this result see APPENDIX I. In figure H] the asymptotic 
prediction is compared to numerical simulations. The systematic deviations that are 
visible in this figure can be attributed to strong sub- leading corrections to ( 138|) . see 
APPENDIX I. 
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Figure 4. Results of Monte-Carlo simulations from 10 9 realizations of the LBM with 
RV's drawn from a normal distribution with standard deviation a — 1. The figure 
shows the normalized difference between the record rate in the time-independent case 
and the drifting case, - (p n (c) — p n (0)). The dots correspond to a simulation with 
drift velocity c = 10 -4 . 



As a more general subset of the Gumbel class we also considered distributions of 
the form / (y) = C^e~^ with f3 > and normalization constant Cp = [2T(1 + 
The integral of interest then reads 

4 = roller, £ d ^ %l " (/l 4 ^" 1 " 1 *)"" (39) 

which can again be treated using a saddle-point approximation. Ignoring constant 
prefactors we find that 

I n oc ln(n) 1 "? (40) 

for large n, which includes the results for the exponential distribution (/? = 1) and 
the Gaussian ((3 = 2) as special cases. For a detailed derivation of this result see 
APPENDIX II. We conclude that the behavior of the correction coefficient I n in the 
Gumbel class is generally intermediate between the power law growth for distributions 
in the Weibull class, and the power law decay for Frechet-type distributions. Again, the 
effect of the drift is stronger for distributions that fall off more rapidly (large 0). 

3.1.4- Relation to the asymptotic record rate p(c). It is instructive to compare the 
asymptotics of the correction term I n derived in the preceding subsections to the 
behavior of the limiting record rate p(c) for small c, which was studied by Le Doussal 
and Wiese [22J. Heuristically, the two quantities can be related as follows. We have 
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seen above that, for any choice of f(y) with a finite first moment, the correction term 
I n becomes large compared to 1/n for large n. This implies that, for any c > 0, the 
first order correction will eventually become comparable to the zero'th order record rate 
1/n. The corresponding time scale n* can be estimated from 

n*J n . ~ c. (41) 

For times n > n* the first-order expansion breaks down and the record rate saturates at a 
nonzero limiting value p(c). Thus we expect that, in order of magnitude, p(c) ~ l/n*(c). 
Using the asymptotic results ( |271l30f40l) together with (j4"Tj) we may then determine the 
behavior of p(c) for small c. The result 

Weibull 



p(c) 



c 



lnc |l-l//3 



Frechet with \x > 1 
Gumbel 



(42) 



agrees with the analysis of [22J in all cases 



3.2. Ordering probability 

In this subsection, we derive a first order expansion for the ordering probability Pjv(c) 
Our main result reads 



^ (c) = M +c (A^yi 



dxf{x) + 0{c 2 ). 



(43) 



In contrast to the expansion ffl8l) for the record rate, one sees that for P/v(c) the 
distribution f(x) only enters in the form of a non-universal constant but has no influence 
on the iV-dependence of the correction term. Note, however, that similar to the 
expansion for p n (c), the correction term diverges when f 2 (x) becomes too singular, 
as is the case for the Weibull-type distribution (|2~4|) with £ < 1/2. 

To prove (j4"3"|) . we set up a Taylor expansion of ffTUj) in c to first order. With 
P N (0) = 1/M we have 



c=0 



1 f°° d 
+ 0(c 2 ) + dx N f(x N ) -P(N - 1, c, x N ) 



c=0 



where terms of 0(c 2 ) and higher have been omitted and 



P(N-l,c,x N ) 



x N +c PVN-i+c 

d-VN-ifiVN-i) / dy N - 2 -- 

-oo J — oo 

XN+C 

dyN-if(yN-i)P(N - 2, c, y N -i). 

■oo 



V2+C 



Clearly, the derivative of P(N — l,c,x^) obeys the recursion relation 



—P(N-l,c,x N ) 
ac 



f{x N )P{N - 2,0, x N ) 



c=0 



/X N + C J 
dVN-ifiVN-i) ^P(N - 2, c, y N . 



c=0 
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Simulation, N=7 □ n 
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Figure 5. Simulations comparing the ordering probability Pn(c) to the first order 
expansion Pn{c) = l/Nl + df/(N — 2)!, where // = f dy f(y) 2 , for a) Gaussian 
distribution and N = 7, b) uniform distribution and N = 5. 



Using the same substitutions as in ( ITTj) . one obtains 

1 



P(N -2,c = 0,x N ) 



iN-2, 



and thus 



dc 



P(N-l,c,x N ) 



(JV-2)! 



-liV— 2, 



c =o (^-2)! 



d yjV -i/(^-i) ^P(iV - 2, c, ^ 



c=0 



Now P(l,c,x 2 ) = J^ C dy 1 f(y 1 ) and thus ±P(l,c,x 2 )\ c=0 = f{x 2 ). Putting this into 
the recursion relation above and integrating over all i/n weighted by /(?/jv), we obtain 



dc' 



Pn(c) 



c =o (^-2)! 

1 

+ 



dy N f 2 (y N )F N - 2 (y N 



+ 



(N — 3)! 
1 



UN 



dy N f(y N ) I dyWWi)^ -3 ^-!) + • • • 



0! 



dy N f(yN) 



VN 



d-VN-lfiVN- 



1.11 



dyif 2 (yi), 



(44) 



a sum with N terms, the last of which comprises N — 1 nested integrals. Somewhat 
miraculously, as shown in APPENDIX III, this chain of integrals can be collapsed into 
the simple closed form advertised in (T43|) . Figure [5] compares the asymptotic expression 
for P/v(c) derived here with numerical simulations. 



4. Record statistics for large drift 

In section [2] we saw that, although record events in the LDM are generally correlated 
for c > 0, the correlations tend to diminish for large c (figure [2]). This is in some sense 
expected, as for c — > oo both p n (c) and Pn(c) tend to unity, such that the stochastic 
independence relation ([3]) becomes trivially satisfied. Moreover, numerical studies [23] 
suggest that the rate of convergence of the record rate to its limiting value p(c) increases 
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Figure 6. Scaling collapse of Pn{c) as suggested by the asymptotic expression (|45|) 
for a) Laplace density f{x) — e~\ x \/2 and b) Levy-density with /i = 1.3. The ordinate 
is the corresponding expression from Eqs. ([48|) and (|49|) respectively. Note that the 
asymptotic expressions get more accurate for larger N. Inset shows direct plots of 
simulation results (points) versus asymptotic expression (lines). 



with c and for sufficiently large values is to a good accuracy attained from the very 
beginning. Thus for large c fl3]) can be approximated by 

Pn(c) « p(cf = (1 - e(c)) W - 1 « e-C^-W, (45) 

where e(c) is the probability that X n is not a record. For large c, only X n _i has an 
appreciable chance of keeping X n from being a record. Thus 

/■oo 

e(c) « P > X n ] = / ds/* 2 (x). (46) 



Here f* 2 (x) denotes the twofold convolution of the probability density f(x) of the i.i.d. 
part of X n . To quote a few examples: 

/(*) = ^=e^ 2 / 2 =► 6(c) = ierfc(c/2) « -L e -^/* (47) 

V27T 2 Ca/TT 



/(x) = L-l*! e (c) = ie~ c + -e~ c {At 



2 w 2 4 

f(x) = — I e - ikx+lk ^ => e(c) = — / da: / dfc e - ifex - 2|fc|M w 7,^^, (49) 

27T 7_oo 27T 

with 



= gji+jOMM. (50) 

The first two of these examples are from the Gumbel class of extreme value statistics, 
whereas the third example is from the Frechet class [151 02] • The asymptotic expression 
in (147p is from [39], while the one in (149p can straightforwardly be derived from the 
known expression for the large-x asymptotics for f(x), see e.g. [ID]. Note that the large 
c asymptotics for the Weibull class is trivial, because both p n and become identically 
equal to unity once c exceeds the range of support of f(y). Inserting the expressions 
E8f49]) into (145]) and also considering the asymptotics of the exact expression for 
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P/v(c) derived in ( I17p . we see that in the limit of large N and c the behavior of the 
ordering probability is generally of the approximate form 

P N (c) « exp[-JV/JV*(c)], (51) 

where N*(c) ~ e c for the Gumbel and exponential distributions, N*(c) ~ e c2 for the 
Gaussian, and N*(c) ~ c M for the Levy distribution. 



0.7 







numerical simulations B 
2T(1+u)*sin(ji*u/2)/(Jt*u) 



























1 1.2 1.4 1.6 1.8 2 



M 

Figure 7. Check of the expression for 7^ from (TJS]). For N — 1024, the range 
< c < 400 was numerically explored as for the data shown in figure [5] The curves 
obtained this way were then fitted to the form exp (— Nj^c - ^). The value of 7^ 
obtained in this way is shown here for various values of n and compared to the analytic 
expression (l50l) . 

To verify the approximations made in this section, we performed numerical 
simulations, see figure [6] and figure [7J The results indicate that our approach, although 
quite rough and not necessarily well-controlled, does indeed capture the interesting 
regime rather well for sufficiently large N and c. 

5. Conclusions 

In this article we considered the statistics of records and sequences of records of 
random variables with a linear trend as described by (j4j). We numerically explored 
the correlations between record events (cf. figure E]) and analytically investigated the 
record rate p n (c) and the ordering probability Pn(c) in the limiting regimes of small 
and large drift velocities, c C 1 and c> 1 respectively. For the regime of c ~ 0(1), we 
have not found a generally applicable method. Thus the behavior of p n (c) and Pn(c) in 
this regime remains an open problem. 

Specifically, we considered the effect of a small linear drift on distributions of the 
three extreme-value classes. While this effect is varying even within the individual 
classes we still found systematic differences between them. For the Frechet class of 
distributions with power-law tails we found that the coefficient of the leading order 
correction to the record rate decays as I n ~ n~ 1 ^ for large n. This implies a distinction 
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between distributions with and without a finite first moment: For // > 1 the correction 
decays more slowly than the unperturbed record rate 1/n, which implies that the drift 
dominates asymptotically and p n (c) attains a nonzero limit for n — > oo; on the other 
hand, for \x < 1 the drift is asymptotically irrelevant. 

For the considered distributions of the Gumbel class the situation was a bit more 
complicated. For the exponential distribution we found a constant additive correction 
to the record rate, while for generalized Gaussian probability densities / oc e"^' 8 the 
correction term was shown to be of order ln(n) 1- ?, which increases (decreases) with 
n when > 1 (j3 < 1). For the distributions of the Weibull class, the effect of the 
drift is the strongest, and the correction term generally increases as a power law in n. 
Moreover, for highly singular distributions with £ < 1/2 in (1241) . we found indications 
for a non- analytic behavior of p n (c) which will be investigated elsewhere. Generally 
speaking, narrow distributions are very sensitive to drift, while for broad distributions 
with heavy tails the effect is much weaker. We have also pointed out that the behavior 
of the first order correction term I n obtained in this paper precisely matches earlier 
results for the asymptotic record rate p(c) [22] . 

For the probability of a sequence of N consecutive records, we find the following: 
For c C 1, the distribution f{y) of the i.i.d. part of X n enters to leading order in c 
only as a numerical constant dxf 2 (x), see (jUJ), but the iV-dependence is completely 
universal for all distributions for which the integral exists. On the other hand, for c ^> 1 
and N 3> 1, the combination in which c and N enter Pjv(c) depends explicitly on the 
tail of the underlying distribution F. This indicates that somewhere in the regime of 
intermediate c, there is a crossover in the c-dependence of P/v(c) from a highly universal 
to a less universal form. 

The result (j43|) has important implications in the context of adaptive paths of 
evolutionary biology: Recalling that the expected number of accessible paths between 
two genotypes which are N mutations apart is given by N\Pn+i, we see in the presence 
of an arbitrarily small drift this quantity increases with N as cN. Thus even a 
weak systematic fitness gradient dramatically increases the accessibility of mutational 
pathways in the direction of increasing fitness. 
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APPENDIX I - Computation of I n for the Gaussian distribution 

We begin by computing the saddle point y defined by d y g (y) = 0, where the function 
g(y) is given in (|35l) . The saddle point satisfies 

-2y + (n-2) . 6 ' _ /2 = 0. (52) 
J^dy'e V 



Records and sequences of records from random variables with a linear trend 18 



For large n this can only be solved by y ^> 1, which implies that dy'e v z ~ y/2n 
and reduces fl52l) to 



e 2 . (53) 



n-2 

By taking the square on both sides of (1531) one finds that the solution is given in terms 
of the Lambert W-function W (z) [23 EH] as 



\ 



(n-2" 2 



W (54) 



in 



(recall that W (z) is defined implicitly through W(z)e ^ = z). Using ( 15 3 p the function 
g and its second derivative at the saddle point take the form 

g(y) ~-f-2 (55) 



and 



d 2 y g{y)^-2{l + y 2 ). (56) 



It follows that 



4tt V d 2 y g (y) Aire 2 \ 1 + y 2 

Using once more (1531) to replace e~^ we obtain 



n (n-l) I -2tt cfj(ij) _ n(7i-l) r^Z e -if, (57) 



2^ n(n-l) y 2 2^ V? / /n*\ 

n e 2 (n-2pyiTF^ ^ * 2 V VSttJ (58) 

for large n, where we have used the expansion [37] W (z) ~ ln(z) — ln(ln(z)) to 
evaluate f!54|) . This expansion also shows that the leading corrections to the asymptotic 
expression (|58|) are of order ln(ln(n 2 /87r))/ ln(n 2 /87r), which accounts for the relatively 
large deviations from the numerical results seen in figure HI 

APPENDIX II - Generalized Gaussian distributions 

Here we consider probability densities of the form 

f(y) = CffiW (59) 

with (3 > and Cp = [2T(1 + We want to evaluate the integral (13"91 in the saddle 

point approximation. Introducing the function 

g(y) = -2/ + (n - 2) In (c!p £ dy'e^'A , (60) 

the saddle point equation d y g (y) = reads, for large n, 

^V-W*'. (61) 

n — 2 
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The solution can again be expressed in terms of the Lambert-W function. Defining 
rj := 1 — 4 we find 



(62) 



Note that this expression is valid both for ft > 1 (rj > 0) and for ft < 1 (rj < 0), but in 
the latter case the second real branch of W(z) has to be used [37]. Using the asymptotics 
of W(z) we obtain 

y fa (inn — 77 ln(?7 _1 lnn)) 1///? fa (lnn) 1///3 (63) 

for large n. 

With the help of fjBTT) the function g and its second derivative at the saddle point 
become 

g(y) fa -2/ - 2 fa -2/ (64) 

and 

d^ (y) fa -2/3(/5 - 1)^~ 2 - 2ft 2 y 2 ^ 2 fa -2/3 2 ^" 2 (65) 

for large y. Thus, using (!5I)I . we see that e 9( -^ fa e -2 ^ ~ y 2/3 ~ 2 /n 2 , and therefore 
(ignoring all constant prefactors) 



/» ~ n2 \l^lff iy) ~ y"' 1 ~ O™) 1 - 1 ^. (66) 
Appendix III - Proof of an expansion 

In this appendix, we will provide the details on the expansion of J^dxf 2 ^) into the 
terms on the right hand side of (|44|) . The starting point is the relation 

F n (x) / dyf 2 (y) =n / dyf{y)F n ~\y) / d^ 2 (^) 



+ f dzf 2 {z)F n {z) 1 (67) 
J —00 



which can be proved by applying integration by parts to the first term on the right hand 
side. With the identities F n (oo) = 1 and F n {— 00) = 0, one obtains from ( IBTj) 

1 r dzf(z) = ^ r dzfiz) 

n! 



00 



x=— 00 



/ d^/ 2 ^) / dx/ 2 (x) (68) 



711 

1 /*OG j /»00 

- / d^ 2 (z)F"(^) + / d^F"" 1 ^) / dz'/V)- 
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The first term of the sum is already identical to the first term in (144j) for n = N — 2. 
Using (j67p on the inner of the two integrals of the second term of the sum above, one 
obtains 

/OO PZ POO PZ 

dzf(z)F n - 1 (z) / dz'f(z') = / dzf(z) / dz'f{z')F n ~\z') 
-oo J —oo J —oo J —oo 

/OO PZ PZ 1 

dzf(z) / dz'f{z')F n - 2 {z') I dz"f 2 {z"). 
-oo J —oo J —oo 

Dividing by (n — 1)! and putting this back into ( 1681) with n = N — 2, one sees that 
now the first two terms of the sum agree with (1441) . By repeating this procedure on 
the terms that do not yet match and noting that finally F°(z) = 1, one has expanded 
dxf 2 (x) into the RHS of (jSJ), which concludes the proof of f|43l) . 
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